Reinforcement learning produces dominant strategies for the Iterated Prisoner’s Dilemma

نویسندگان

Marc Harper

Vincent A. Knight

Martin Jones

Georgios Koutsovoulos

Nikoleta E. Glynatsi

Owen Campbell

چکیده

We present tournament results and several powerful strategies for the Iterated Prisoner's Dilemma created using reinforcement learning techniques (evolutionary and particle swarm algorithms). These strategies are trained to perform well against a corpus of over 170 distinct opponents, including many well-known and classic strategies. All the trained strategies win standard tournaments against the total collection of other opponents. The trained strategies and one particular human made designed strategy are the top performers in noisy tournaments also.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Cross Entropy Method for the N-Persons Iterated Prisoner’s Dilemma

We apply the Cross-entropy method to the N persons Iterated Prisoners Dilemma and show that cooperation is more readily achieved than with existing methods such as genetic algorithms or reinforcement learning.

متن کامل

Opponent Modelling and Strategy Evolution in the Iterated Prisoner’s Dilemma

Learning and evolution are two adaptive processes in the natural world that have been modelled in the study of artificial intelligence in computer science. In both biology and in artificial intelligence, learning and evolution are complementary processes. The nature of the interactions between learning and evolution has been the subject of much research in scientific disciplines. Evolution of a...

متن کامل

Towards Cooperation in Sequential Prisoner's Dilemmas: a Deep Multiagent Reinforcement Learning Approach

The Iterated Prisoner’s Dilemma has guided research on social dilemmas for decades. However, it distinguishes between only two atomic actions: cooperate and defect. In real-world prisoner’s dilemmas, these choices are temporally extended and different strategies may correspond to sequences of actions, reflecting grades of cooperation. We introduce a Sequential Prisoner’s Dilemma (SPD) game to b...

متن کامل

Role of Iterated Prisoner’s Dilemma in Genetic Based Machine Learning

Several strategies have been followed by most of earlier researchers in the field of machine learning. Agarwal has connected Machine Learning with Iterated Prisoner’s Dilemma Problem [IPD]. Holland has proposed basic directions to explore goal of genetic operators in the study of machine learning. Axelrod connected Genetic Algorithm with IPD. We integrate these basic approaches to give a novel ...

متن کامل

An Experimental Study of N-Person Iterated Prisoner's Dilemma Games

The Iterated Prisoner’s Dilemma game has been used extensively in the study of the evolution of cooperative behaviours in social and biological systems. There have been a lot of experimental studies on evolving strategies for 2-player Iterated Prisoner’s Dilemma games (2IPD). However, there are many real world problems, especially many social and economic ones, which cannot be modelled by the 2...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره 12 شماره

صفحات -

تاریخ انتشار 2017

Reinforcement learning produces dominant strategies for the Iterated Prisoner’s Dilemma

نویسندگان

چکیده

منابع مشابه

The Cross Entropy Method for the N-Persons Iterated Prisoner’s Dilemma

Opponent Modelling and Strategy Evolution in the Iterated Prisoner’s Dilemma

Towards Cooperation in Sequential Prisoner's Dilemmas: a Deep Multiagent Reinforcement Learning Approach

Role of Iterated Prisoner’s Dilemma in Genetic Based Machine Learning

An Experimental Study of N-Person Iterated Prisoner's Dilemma Games

عنوان ژورنال:

اشتراک گذاری